European Journal of Epidemiology
○ Springer Science and Business Media LLC
Preprints posted in the last 7 days, ranked by how well they match European Journal of Epidemiology's content profile, based on 40 papers previously published here. The average preprint has a 0.03% match score for this journal, so anything above that is already an above-average fit.
Franzese, F.; Bergmann, M.; Burzynska, A.
Show abstract
Socioeconomic inequalities in health and well-being are a major public health concern, particularly in ageing populations. Education is a key determinant shaping multiple aspects of health outcomes. We used cross-sectional data from wave 9 of the German sample (n=4,148) of the Survey of Health, Ageing and Retirement in Europe (SHARE) to test whether formal education is associated with well-being in later adulthood, with health literacy, self-rated health, and preventive health behaviours as possible mediators. Our results showed that education was positively associated with greater well-being, but only via indirect pathways. Specifically, self-rated health, health literacy, and fruit and vegetable consumption mediated the relationship between education and well-being accounting for 54.7, 24.7, and 12.6 percent of the total effect, respectively. In addition, there were significant positive correlations between education and health literacy, as well as high-intensity physical activity, daily fruit and vegetable consumption, more preventive health check-ups, and less smoking. In contrast, alcohol consumption was more common among those with higher levels of education. All health behaviours and health literacy were correlated directly or indirectly (i.e., mediated by health) with well-being. These findings highlight the importance of examining indirect pathways linking education to well-being in later life. Interventions aimed at improving health literacy and promoting healthy behaviours may help reduce educational inequalities in quality of life among older adults.
Bui, L. V.; Nguyen, D. N.
Show abstract
Background. Vietnam's disease burden has shifted from communicable, maternal, neonatal, and nutritional (CMNN) causes to non-communicable diseases (NCDs), but the tempo, drivers, and regional positioning of this transition have not been jointly quantified. We characterised Vietnam's epidemiological transition 1990-2023 against ten Southeast-Asian (SEA) peers. Methods. Using Global Burden of Disease 2023 data, we computed joinpoint-regression AAPC with 95% CI (BIC-penalised, up to three break-points) for age-standardised DALY rates and cause-composition shares. We applied Das Gupta three-factor decomposition to 1990-2023 absolute DALY change (population-size, age-structure, age-specific-rate effects) and benchmarked Vietnam's NCD share against an SDI-conditional peer trajectory via leave-one-out quadratic regression. Premature mortality was quantified as WHO 30q70 under both broad NCD and strict SDG 3.4.1 definitions, using Chiang II life-table adjustment identically across all eleven countries. Findings. The CMNN age-standardised DALY rate fell from 13,295.9 to 4,022.1 per 100,000 (AAPC -4.63%/year; 95% CI -4.80 to -4.46); the NCD rate fell only from 21,688.2 to 19,282.8 (AAPC -0.37; -0.45 to -0.30). NCD share of total DALYs rose from 52.99% to 70.67% (+17.67 pp; AAPC +1.09). Vietnam ranked fourth of eleven SEA countries in 2023 (up from sixth in 1990) and sat 5.3% above the SDI-expected trajectory. Das Gupta decomposition attributed the +10.63 million NCD DALY increase to population growth (+6.26 M) and ageing (+6.08 M); rate change removed only 1.71 M. Premature NCD mortality fell from 25.02% to 21.80% (broad, 12.9% reduction) and from 22.17% to 19.50% (SDG 3.4.1, 12.0%; Vietnam sixth of eleven) - far short of the SDG 3.4 one-third-reduction target. Interpretation. Vietnam has entered a disability- and ageing-dominated NCD phase. Meeting SDG 3.4 by 2030 requires population-scale primary prevention sized to demographic momentum.
Reisberg, S.; Oja, M.; Mooses, K.; Tamm, S.; Sild, A.; Talvik, H.-A.; Laur, S.; Kolde, R.; Vilo, J.
Show abstract
Background: The increasing availability of routinely collected health data offers new opportunities for population-level research, yet access to comprehensive, linked, and standardised datasets remains limited. We describe EST-Health-30, a large-scale, population-representative health data resource from Estonia. Methods: EST-Health-30 comprises a random 30% sample of the Estonian population (~500,000 individuals), with longitudinal data from 2012 to 2024 and annual updates planned through 2026. Individual-level records are linked across five nationwide databases, including electronic health records, health insurance claims, prescription data, cancer registry, and cause of death records. A privacy-preserving hashing approach ensures consistent cohort inclusion over time while maintaining pseudonymisation. All data are harmonised to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (version 5.4) using international standard vocabularies. Data quality was assessed using established OMOP-based validation frameworks. Results: The dataset contains rich multimodal information on diagnoses, procedures, laboratory measurements, prescriptions, free-text clinical notes, healthcare utilisation, and costs, with high population coverage and longitudinal depth. Data quality assessment showed high completeness and consistency, with 99.2% of applicable checks passing. The age-sex distribution closely reflects the national population, supporting representativeness, though coverage is marginally below the target 30% (29.2%), primarily attributable to recent immigrants without health system contact. The dataset enables construction of detailed clinical cohorts, analysis of disease trajectories, and evaluation of healthcare utilisation and outcomes across the life course. Conclusions: EST-Health-30 is a comprehensive, standardised, and population-representative real-world data resource that supports epidemiological, clinical, and methodological research. Its alignment with the OMOP CDM facilitates reproducible analytics and participation in international federated research networks, while secure access infrastructure ensures compliance with data protection regulations.
Schmidt, C.; Samartsidis, P.; Seaman, S.; Emmanouil, B.; Foster, G.; Reid, L.; Smith, S.; De Angelis, D.
Show abstract
To minimise health disparities, equitable access to medical treatment is paramount. In a pioneering intervention, National Health Service Englands Hepatitis C virus (HCV) programme has implemented country-wide peer support to boost treatment access. Peer support workers (peers) are individuals with relevant lived experience, who promote testing and treatment in marginalised populations underserved by traditional health services. We evaluated the English peers intervention, exploiting its staggered rollout and rich surveillance data between June 2016 and May 2021. Peers increased HCV cases identified by 13{middle dot}9% (95% credible interval (95% CrI) [5{middle dot}3, 21{middle dot}7]), sustained viral responses by 8{middle dot}0% (95% CrI [-4{middle dot}4, 18{middle dot}6]), and drug services referrals by 8{middle dot}8% (95% CrI [-12{middle dot}5, 22{middle dot}6]). The interventions effectiveness was magnified during the first COVID-19 lockdown and individuals supported by peers typically belonged to populations with poor treatment access. Our findings indicate that peers can boost equity in treatment access on a national scale.
Wang, J.; Morrison, J.
Show abstract
1Mendelian randomization (MR) uses genetic variants as instrumental variables to infer causal relationships between complex traits. Standard MR can be used to estimate an average causal effect at the population level, and typically assumes a linear exposure-outcome relationship. Recently, several methods for estimating nonlinear effects have been developed. However, many have been found to produce spurious empirical findings when subjected to negative control analyses. We propose that this poor performance may be attributable to heterogeneity in variant-exposure associations. We demonstrate that heterogeneous genetic effects on exposure lead to biased estimates, poor coverage, and inflated type I error in control function and stratification-based methods. In contrast, two-stage least squares (TSLS) methods are robust to such heterogeneity, but suffer from low precision and low power in some circumstances. We show that a statistical test for heterogeneity can be used to guide the choice of nonlinear MR methods. Using UK Biobank data, we reassess the causal effects of BMI, vitamin D, and alcohol consumption on blood pressure, lipid, C-reactive protein, and age (negative control). We find strong evidence of heterogeneity for all three exposures, and also recapitulate previous results that control function and stratification-based methods are prone to false positives. Finally, using nonparametric TSLS, we identify evidence of nonlinear causal effects of BMI on HDL cholesterol, triglycerides, and C-reactive protein; however, specific estimates of the shape of these relationships are imprecise. Altogether, our results suggest that common nonlinear MR methods are unreliable in the presence of realistic levels of heterogeneity, and that more methodological development is required before practically useful nonlinear MR is feasible.
Gantenberg, J. R.; La Joie, R.; Heston, M. B.; Ackley, S. F.
Show abstract
Qualitative models of Alzheimers pathology often posit that amyloid accumulation follows a sigmoid curve, indicating that the rate of deposition wanes over time. Longitudinal PET data now allow us to investigate amyloid accumulation trajectories with greater detail and over longer follow-up periods. We combine inferences from simulated amyloid trajectories, empirical PET data from the Alzheimers Disease Neuroimaging Initiative (ADNI), and the sampled iterative local approximation algorithm (SILA) to assess whether amyloid accumulation reaches a physiologic ceiling. We find that SILA reliably detects a ceiling, when present, across a range of simulated scenarios that impose a sigmoid shape. When fit to empirical data from ADNI, however, SILA does not appear to indicate the presence of a ceiling. Thus, we conclude that amyloid trajectories may not reach a physiologic ceiling during the stages of Alzheimers disease typically observed while patients remain under follow-up in cohort studies. Fits using SILA indicate that illustrative models of biomarker cascades, while useful tools for conceptualizing and interrogating pathologic processes, may not represent the shapes of amyloid trajectories accurately. Summary for General PublicAmyloid, a protein implicated in Alzheimers disease, is thought to reach a plateau in the brain, but methods that estimate how amyloid changes over time suggest it grows unabated. Gantenberg et al. use one such method and simulations to argue that amyloid does not reach a plateau during the typical course of Alzheimers.
Goryanin, I.; Damms, B.; Goryanin, I.
Show abstract
Background: Ageing is a systems level biological process underlying the onset and progression of multiple chronic disorders. Rather than arising from a single pathway, age related decline reflects interacting disturbances in metabolic regulation, inflammation, nutrient sensing, cellular stress responses, and tissue repair. Although GLP1 receptor agonists, sodium glucose cotransporter2 inhibitors, metformin, and rapamycin are usually evaluated against disease-specific endpoints. Objective: To develop an SBML compliant quantitative systems pharmacology model in which ageing is the primary pharmacological endpoint and to evaluate which combination therapy provides the greatest benefit for both metabolic and ageing related outcomes. Methods: We developed model comprising four layers: a metabolic/pharmacodynamic layer describing weight loss, HbA1c reduction, and nausea with tolerance; a drug layer capturing class-specific effects of GLP1 agonists, sodium glucose cotransporter2 inhibitors, metformin, and rapamycin; an ageing layer representing damage accumulation, repair capacity, frailty, and biological age gap; and a biomarker layer generating trajectories and estimated glucose disposal rate. Calibration was staged across semaglutide clinical endpoints. Bayesian hierarchical meta analysis, global sensitivity analysis, and practical identifiability analysis were used to assess robustness and interpretability. Results: The model reproduced semaglutide efficacy and tolerability dynamics and supported distinct drug-class profiles across metabolic and ageing axes. Rapamycin showed minimal glycaemic effect but emerged as a dominant driver of repair related ageing outcomes. Combination simulations predicted two distinct optima: one favouring metabolic improvement and one favouring ageing related benefit. Conclusion: The model supports the view that metabolic and ageing optimization are mechanistically distinct objectives and that weight loss and glycaemic improvement alone may be insufficient surrogates for health span benefit.
Van, T. A.
Show abstract
BackgroundType 2 diabetes mellitus (T2DM) is a leading global public health challenge. Machine learning (ML) combined with Explainable AI (XAI) is increasingly applied to T2DM risk prediction, but the field lacks a quantitative overview of methodological trends and integration gaps. MethodsWe present a structured synthesis and critical analysis of the XAI literature on T2DM risk prediction, combining (i) quantitative bibliometric analysis of a two-database corpus (N = 2,048 documents from Scopus and PubMed/MEDLINE, deduplicated via a transparent three-tier pipeline) and (ii) an in-depth selective review of 15 highly cited papers. Reporting follows PRISMA 2020, adapted for metadata-based synthesis; analyses include keyword frequency, rule-based thematic clustering, and publication trend analysis. ResultsThe field grew rapidly, from 36 documents (2020) to 866 (2025). SHAP and LIME dominate XAI methods; XGBoost and Random Forest dominate ML models. Critically, KG/GNN terms appeared in only 17 documents ([~]0.83%) compared with 906 for XAI methods, a 53.3:1 disparity. This gap is consistent across both databases, which share 33.2% of their records, ruling out a single-database artifact. The selective review confirmed that none of the 15 highly cited papers combined all three components, ML, XAI, and KG, in T2DM risk prediction. ConclusionsThe XAI for T2DM risk prediction field exhibits a clinical interpretability gap: statistical explanations are rarely linked to structured clinical pathways. We propose a three-layer conceptual framework (Predictive [->] Explainability [->] Knowledge) that integrates KG as a supplementary semantic layer, with potential applications in clinical decision support and population-level screening. The framework does not perform true causal inference but structures explanations around established pathophysiological knowledge. This study contributes a transferable methodology and a quantified research gap to guide future work integrating ML, XAI, and structured medical knowledge.
Li, Y.; Cabral, H.; Tripodis, Y.; Ma, J.; Levy, D.; Joehanes, R.; Liu, C.; Lee, J.
Show abstract
Mediation analysis quantifies how an exposure affects an outcome through an intermediate variable. We extend mediation analysis to capture the cumulative effects of longitudinal predictors on longitudinal outcomes. Our proposed model examines how mediators transmit the effects of the current and previous exposure on the current outcome. We construct a least-squared estimator for cumulative indirect effect (CIE) and used three approaches (exact form, delta method, and bootstrap procedure) to estimate its standard error (SE). The estimator of CIE is unbiased with no unmeasured confounding and independent model errors between mediator model and outcome model at all time points, as shown in statistical inference and in simulations. While three SE estimates are numerically similar, bootstrap procedure is recommended due to its simplicity in implementation. We apply this method to Framingham Heart Study offspring cohort to assess if DNA methylation mediates the association of alcohol consumption with systolic blood pressure over two time points. We identify two CpGs (cg05130679 and cg05465916) as mediators and construct a composite DNA methylation score from 11 CpGs, which mediates for 39% of the cumulative effect. In conclusion, we propose an unbiased estimator for CIE. Future studies will investigate the missingness in mediators and outcomes.
Haug, M.; Ilves, N.; Umov, N.; Loorents, H.; Suvalov, H.; Tamm, S.; Oja, M.; Reisberg, S.; Vilo, J.; Kolde, R.
Show abstract
Abstract Objective To address the unresolved bottleneck of selecting cohort-relevant clinical concepts for treatment trajectory analysis in observational health data, we introduce CohortContrast, an OMOP-compatible R package for enrichment-based concept identification, temporal and semantic noise reduction, and concept aggregation, enabling cohort-level characterization and downstream trajectory analysis. Materials and Methods We developed CohortContrast and applied it to OMOP-mapped observational data from the Estonian nationwide OPTIMA database, which includes all cases of lung, breast, and prostate cancer, focusing here on lung and prostate cancer cohorts. The workflow combines target-control statistical enrichment, temporal/global noise filtering, hierarchical concept aggregation and correlation-based merging, with optional patient clustering for downstream trajectory exploration. We validated the approach with a clinician-based plausibility assessment of extracted diagnosis-concept pairs and evaluated a large language model (LLM) as an auxiliary filtering step. Results We analyzed 7,579 lung cancer and 11,547 prostate cancer patients. The workflow reduced concept dimensionality from 5,793 to 296 concepts (94.9%) in lung cancer and from 5,759 to 170 concepts (97.0%) in prostate cancer, and identified three exploratory patient subgroups in both cohorts. In a plausibility assessment of 466 diagnosis-concept pairs, validators rated 31.3% as directly linked and 57.5% as indirectly linked. Discussion CohortContrast reduces manual concept curation by prioritizing and aggregating cohort-relevant concepts while preserving clinically interpretable treatment patterns in OMOP-based real-world data. Conclusion CohortContrast enables scalable reduction of broad OMOP concept spaces into clinically interpretable, cohort-specific representations for exploratory trajectory analysis and real-world evidence research.
Robert, A.; Goodfellow, L.; Pellis, L.; van Leeuwen, E.; Edmunds, W. J.; Quilty, B. J.; van Zandvoort, K.; Eggo, R. M.
Show abstract
BackgroundIn England, the burden of respiratory infections varies by ethnicity, contributing to health inequalities, but the role of additional demographic factors remains underexplored. We quantified how differences in social mixing and demographic characteristics between ethnic groups cause inequalities in transmission dynamics. MethodsWe analysed the association between the ethnicity and the number of contacts of 12,484 participants in the 2024-2025 Reconnect social contact survey, using a negative binomial regression model. We simulated respiratory pathogen epidemics using a compartmental model stratified by age, ethnicity, and contact levels, at a national level and in major cities in England. FindingsAfter adjusting for demographic variables, participants of Black and Mixed ethnicities had more contacts than those of White ethnicity (rate ratios (RR): 1.18 [95% Credible Interval (CI): 1.11-1.26], and 1.31 [95% CI: 1.14-1.52]). Participants of Asian ethnicity had fewer contacts (RR: 0.85 [95% CI: 0.79-0.91]). In national-level simulations, individuals of White ethnicity had the lowest attack rates due to demographic differences and mixing patterns. Local demographic structures changed simulated dynamics: attack rates in individuals of Black and Mixed ethnicities were approximately double those of White ethnicity in Birmingham, but less than 60% higher in Liverpool. InterpretationDemographic characteristics and mixing patterns create inequalities in transmission dynamics between ethnicities, while local demographic characteristics and pathogen infectiousness change the expected relative burden. To ensure mitigation strategies are effective and equitable, their evaluation must explicitly account for inequalities arising from local context. FundingMedical Research Council, National Institute for Health and Care Research, Wellcome Trust Research in context Evidence before this studyWe searched PubMed for population-based studies quantifying differences in respiratory infections between ethnic groups, up to 1 April 2026, with no language restrictions. Keywords included: (respiratory pathogens OR influenza OR COVID-19) AND (ethnic* OR race) AND (inequ*) AND (compartmental model OR incidence rate ratio OR hazard ratio). We excluded studies that focused on non-respiratory pathogens (e.g. looking at consequences of COVID-19 on incidence of other pathogens). A population-based cohort study showed that influenza infection risk was higher in South Asian, Black, and Mixed ethnic groups compared to White ethnicity in England. Another population-based cohort study highlighted that during the first wave of COVID-19 in England, the South Asian, Black, and Mixed ethnic groups were more likely to test positive and to be hospitalised than the White ethnic group. Census data in England showed that the distributions of age, household size, household income and employment status differed between ethnic groups, and the recent Reconnect social contact surveys highlighted the impact of each demographic factor on the participants number of contacts. Added value of this studyOur study shows that social contact patterns, mixing, and demographic structure all lead to unequal infection risk between ethnic groups in respiratory pathogen epidemics. Using the largest available social contact survey in England, we show that both the average number of contacts and the proportion of high-contact individuals varied by ethnic group, even after adjusting for participants demographics. These differences, together with mixing patterns and age structure, led to lower expected incidence among individuals of White ethnicity than in all other ethnic groups in simulated outbreaks. The level of inequality between ethnic groups changed when we used different values of pathogen transmissibility. Finally, as ethnic composition and population structure differ between cities in England, our results show differences in expected inequalities at a local level. Implications of all the available evidenceInequalities in infection risk between ethnic groups are context- and pathogen-dependent. They arise from both local population structure and contact patterns. Detailed information on mixing between groups and population structure is needed to accurately measure group-specific infection risk. These findings indicate that public health interventions based only on national-level estimates conceal regional variation in risk and may ultimately increase inequalities. Public health interventions need to be tailored to local contexts to be equitable and effective. Finally, our findings provide a foundation for understanding the progression from infection-risk inequalities to disparities in disease presentation and clinical outcomes.
Lin, T.; Li, Y.; Huang, Z.; Gui, T. T.; Wang, W.; Guo, Y.
Show abstract
Target trial emulation (TTE) offers a principled way to estimate treatment effects using real-world observational data, but analyses of time-varying treatment strategies remain vulnerable to immortal time bias. The clone-censor-weight (CCW) approach is increasingly used to address this problem, yet key aspects of its causal interpretation and implementation remain unclear. In this work, we emulate a target trial using electronic health records (EHRs) to compare completion of a 3-dose 9-valent human papillomavirus vaccination (HPV) series within 12 months versus remaining partially vaccinated among vaccine initiators. We link CCW to the classic potential outcome framework in causal inference, evaluate the role of different weighting mechanisms, and account for within-subject correlation induced by cloning using cluster-robust variance estimation. Our study provides practical guidance for applying CCW in real-world comparative effectiveness studies to address immortal time bias and supports more rigorous and interpretable treatment effect estimation in TTE.
Alfaro, H. E.; Lara-Arevalo, J.
Show abstract
Ambulatory Care Sensitive Conditions (ACSCs) are conditions for which effective and timely primary health care (PHC) can prevent hospitalizations. They are widely used as a proxy indicator of access to and quality of PHC. Despite their relevance, evidence from Central America remains scarce. This study aimed to quantify the burden, describe the epidemiological profile, and assess temporal trends of ACSCs hospitalizations in Honduras from 2014 to 2024. We conducted a retrospective observational study using national administrative hospital discharge data from all Ministry of Health hospitals. ACSCs were defined using a standardized list of 20 diagnostic groups based on ICD-10 codes. We estimated percentages and sex-age-standardized hospitalization rates per 10,000 inhabitants. Clinical indicators included length of stay (LOS) and in-hospital fatality rates. Temporal trends were evaluated using joinpoint regression models to estimate annual percent changes (APC). Analyses included stratification by age, sex, and disease category. A total of 4,023,944 hospitalizations were analyzed, of which 547,486 (13.6%) were classified as ACSCs. The overall sex-age-standardized rate was 54.1 per 10,000 inhabitants. ACSCs' standardized rates increased between 2014 and 2018 (APC: 2.7%; 95% CI: -2.4; 15.2), declined sharply between 2018 and 2021 (APC: -17.8%; 95% CI: -30.6; -10.3), and increased again between 2021 and 2024 (APC: 15.9%; 95% CI: 4.6; 37.6). Despite this rebound, rates remained below pre-pandemic levels. ACSCs were concentrated among children under 5 years (27.7%) and adults aged 60 years and older (29.9%). Noncommunicable diseases accounted for 56.8% of cases, with diabetes mellitus as the leading cause. Compared with non-ACSCs hospitalizations, ACSCs were associated with longer LOS (4.9 vs. 3.9 days; p <0.001) and higher in-hospital fatality rates (2.4% vs. 1.7%; p <0.001). ACSCs hospitalizations constitute a substantial burden in Honduras and reflect persistent gaps in PHC performance. Strengthening PHC resilience and capacity, particularly for chronic disease management and vulnerable populations, is essential to reduce avoidable hospitalizations and improve health system efficiency and equity.
Staples, J. W.; White, S. L.; Giacalone, A.; Pozdeyev, N.; Sammel, M. D.; Stranger, B. E.; Valencia, C. I.; Santoro, N.; Hendricks, A. E.
Show abstract
Objective. Menopause is a significant physiological transition with implications for health outcomes (e.g., cardiometabolic), yet gaps remain in understanding the menopause transition, including how menopause timing and type influence health outcomes. Large-scale cohort studies in midlife (age~40-60) females, including the All of Us Research Program (AoURP), provide opportunities to study menopause across diverse populations and data modalities. We characterized menopause-related data in AoURP, focusing on age distributions and concordance between EHR diagnosis codes and self-reported survey responses. Methods. We analyzed menopause-related survey, EHR diagnostic code, and genomic data among ~396,000 participants in AoURP with female sex. We summarized menopause data across modalities, overlap between survey, EHR, and genomic data, and age distributions overall and across sociodemographic characteristics. Results. Among ~396,000 females, surveys captured ~193,000 menopause observations, nearly seven times more than structured EHR diagnoses (~28,000), suggesting under- ascertainement in EHR data. Nearly all females (~99%) with an EHR menopause diagnosis also reported menopause in the survey. Approximately 22,000 participants had intersected EHR, survey, and genomic menopause-related data. Survey-based age patterns matched expectations, with participants <40 years predominantly reporting pre-menopausal status and those >60 years predominantly reporting post-menopausal status. A small subset (N{approx}1,700; 4%) (age>70 years) reported no menopause, suggesting response or recall bias. EHR menopause codes were concentrated after age>45 years, with a notable spike at age 65. Modest differences in survey-based menopause age distributions were observed by sociodemographic characteristics (e.g., race, ancestry). Conclusions. These findings inform sampling strategies, power calculations, phenotype definition, and study design for menopause research using AoURP.
Luo, M.; Trindade Pons, V.; Zakharin, M.; Pingault, J.-B.; Gillespie, N. A.; van Loo, H. M.
Show abstract
Substance use disorders run in families, yet the mechanisms underlying intergenerational transmission remain unclear. We investigated indirect genetic effects, pathways through which parental genotypes influence offspring phenotypes via the family environment, for alcohol use disorder (AUD), nicotine dependence (ND), and related quantitative outcomes, and aimed to identify family environmental factors through which such effects may operate. Using transmitted and non-transmitted polygenic scores (PGS) constructed for problematic alcohol use, tobacco use disorder, and general addiction liability, we analyzed 5972 European-ancestry adult offspring with at least one genotyped parent from the population-based Lifelines cohort (Netherlands). Offspring outcomes included lifetime DSM-5 AUD diagnosis, AUD symptom count, maximum drinks in 24 hours, Fagerstrom Test for Nicotine Dependence score, and cigarettes per day. AUD findings were meta-analyzed with data from the Brisbane Longitudinal Twin Study (N = 1368; Australia). We also examined parent-of-origin effects and mediation by parental substance use and socioeconomic status using structural equation modeling. Transmitted PGS robustly predicted all AUD and ND outcomes ({beta} = 0.07-0.16; OR = 1.20 for AUD diagnosis). Non-transmitted PGS, indexing indirect genetic effects, were negligible for all clinical syndrome outcomes. The only significant indirect genetic effect was on cigarettes per day ({beta} = 0.03, p = 0.01), mediated by parental smoking behavior but not socioeconomic status. These findings indicate that intergenerational transmission of risk for AUD and ND is driven primarily by direct genetic effects, with modest indirect genetic effects on smoking quantity. Larger samples and cross-trait analyses are needed to further elucidate these mechanisms.
Ekenze, O.; Scott, M. R.; Himali, D.; Lioutas, V.-A.; Seshadri, S.; Howard, V. J.; Fornage, M.; Aparicio, H. J.; Beiser, A. S.; Romero, J. R.
Show abstract
Sex specific differences in stroke are recognized. Whether differences in incident stroke risk persists in recent periods needs further elucidation to aid public health preventive efforts. Aim: To determine long-term sex specific trends in stroke and stroke risk factors at different epochs among Framingham Heart Study participants. Methods: We examined age-adjusted 10-year stroke incidence using Cox regression in women and men in five epochs: 1962-1969 (epoch 1, reference), 1971-1976 (epoch 2), 1987-1991 (epoch 3), 1998-2005 (epoch 4), 2015-2021 (epoch 5). We compared stroke incidence by sex across epochs, estimated decade-wise linear trends overall and by sex. We compared risk factors in successive epochs to the first, and estimated sex-specific trends in risk factors. Interactions between baseline risk factors with epoch and trends were assessed by sex. Secondary analyses were repeated in participants <60 years old. Results: Incident stroke occurred in 4.5% (178/3996) in epoch 1, 3.9% (227/5786) in epoch 2, 3.9% (199/5137) in epoch 3, 2.7% (207/7642) in epoch 4, 2.2% (119/5534) in epoch 5. Men had higher risk of incident stroke in each epoch with significant difference in epochs 2 (HR 1.41, 95% CI [1.08, 1.84]) and 4 (HR 1.46, 95% CI [1.11, 1.91]) overall, and in epoch 4 (HR 2.13, 95% CI [1.17, 3.87]) among those <60 years. Stroke incidence declined by 16% per decade in men (HR 0.84, 95% CI [0.79, 0.89]) and 19% per decade in women (HR 0.81, 95% CI [0.76, 0.86]). Among those <60 years, stroke incidence declined by 22% per decade in women (HR 0.78, 95% CI [0.67, 0.95]). Hypertension declined by 8% per decade in women only ([OR] 0.92, 95% CI [0.90, 0.94]), while Atrial fibrillation and diabetes increased in both. Conclusion: Stroke incidence continues to decline in recent periods for women and men. Among participants <60 years, decline was observed only in women, possibly related to decline in hypertension in women.
Ouedraogo, F. A. S.
Show abstract
Despite the evolution of epidemiological analysis and modeling tools, difficulties still remain, especially in developing countries, regarding the availability and use of these tools. Often expensive, requiring high technical expertise, demanding constant connectivity of several or sometimes even significant resources, these tools, although efficient, present a major gap with the operational realities of health districts. It is in this context that we introduce Episia, an open-source Python library designed and conceived to provide a framework to facilitate epidemiological analysis and modeling. It integrates a suite of compartmental epidemic models (SIR, SEIR, SEIRD) with a sensitivity analysis using the Monte Carlo method, a complete biostatistics suite validated against the OpenEpi reference standard, as well as a native DHIS2 client for automated data ingestion. Developed in Burkina Faso, it is optimized and aims not only to address these health challenges encountered in Africa but also remains a versatile tool for global health informatics.
Martin, C. M.; henderson, i.; Campbell, D.; Stockman, K.
Show abstract
Background: The instability-plasticity framework proposes that multimorbidity trajectories periodically enter instability phases that are vulnerable to escalation but also potentially modifiable through relational intervention. Whether such phases commonly resolve without acute care, or predominantly progress to hospitalisation, has not been quantified at scale. Objective: To quantify instability window outcomes across a longitudinal monitoring cohort; to test whether the characteristics distinguishing admitted from resolved windows reflect within-patient trajectory dynamics or between-patient severity; and to characterise which patient-reported and operator-rated signals reliably precede admission, using both a curated pilot sub-cohort and the full monitoring cohort with an explicit cross-cohort comparison. Methods: Two complementary analyses were conducted on data from the MonashWatch Patient Journey Record (PaJR) relational telehealth system. Instability windows were identified algorithmically (>=2 consecutive calls with Total_Alerts >=3) across the full longitudinal dataset (16,383 calls, 244 patients, 2.5 years) and classified by linkage to ED and hospital admission data. Window characteristics were compared at window, patient, and paired within-patient levels. Pre-admission signal cascades were analysed in two configurations: a curated pilot sub-cohort (64 patients, 280 calls, +/-10-day window, 103 admissions, December 2016-September 2017) and the full monitoring cohort (175 patients, 1,180 pre-admission calls, +/-14-day window, December 2016-July 2019). A three-way cross-cohort comparison decomposed differences between the two configurations into pipeline and population effects. Results: 621 instability windows were identified across 157 patients (64% of the monitored cohort). 67.3% resolved without hospital admission or ED attendance, a rate stable across alert thresholds 1-5. In paired within-patient analysis (n = 70), duration in days (p = 0.002) and multi-domain breadth (p < 0.001) distinguished admitted from resolved windows; alert intensity did not. In the pilot sub-cohort, patient-reported illness prognosis (Q21) was the dominant pre-admission signal (GEE beta = +0.058, AUC = 0.647, p-BH = 0.018). This finding did not replicate in the full cohort: Q21 was non-significant (GEE beta = -0.008, p = 0.154, AUC = 0.507). Cross-cohort analysis identified selective curation of the pilot sub-cohort as the primary explanation. In the full cohort, six signals escalated significantly before admission after Benjamini-Hochberg correction: total alerts, health impairment (Q26), red alerts, self-rated health (Q3), patient concerns (Q1), and operator concern (Q34). Health impairment achieved the highest individual AUC (0.605) and showed the longest pre-admission lead. No individual signal exceeded AUC 0.61. Conclusions: Two thirds of instability phases resolve without hospitalisation, providing direct empirical support for trajectory plasticity as a clinically frequent phenomenon. Within the same patient, persistence - in duration and in the consistency of high-severity multi-domain flagging across calls - distinguishes trajectories that tip into admission from those that resolve. The Q21 signal reversal between cohorts illustrates how selective curation can produce compelling but non-replicable findings in monitoring research. In the full population, objective alert signals and operator judgement, rather than patient illness prognosis, carry the pre-admission signal
Nilsson, A.; da Silva, M.; Le, H. T.; Haggstrom, C.; Wahlstrom, J.; Michaelsson, K.; Trolle Lagerros, Y.; Sandin, S.; Magnusson, P. K.; Fritz, J.; Stocks, T.
Show abstract
Excess body weight has been associated with increased cancer risk, but the role of weight change across adulthood remains unclear. We examined body weight trajectories from ages 17 to 60 and their associations with site-specific cancer incidence. Data were based on the ODDS study, a pooled, nationwide cohort study in Sweden, with data on weight spanning 1911 to 2020, and cancer follow-up through 2023. Weight trajectories were estimated with linear mixed effects models in individuals with at least three weight measurements. Cox regressions estimated hazard ratios for associations between weight trajectories and established and potentially obesity-related cancers. Fifth versus first quintile of weight change was associated with many cancers, most strongly with esophageal adenocarcinoma in men (HR 2.25; 95% CI 1.66-3.04), liver cancer in men (HR 2.67; 95% CI 2.15-3.33), endometrial cancer in women (HR 3.78; 95% CI 3.09-4.61), and pituitary tumors in both sexes (men: HR 3.13 [95% CI 2.13-4.61]; women: HR 2.13 [95% CI 1.41-3.22]). Associations varied by sex and age. Heavier weight at age 17 years and earlier obesity onset were also associated with higher cancer incidence. These findings highlight the importance of a life-course approach to weight management and support sex- and age-targeted cancer prevention strategies.
Abeysooriya, M. D.; Hiam, D.; Voisin, S.; Eynon, N.; Ziemann, M.; Lamon, S.
Show abstract
BackgroundAgeing is a sex-specific process characterised by a progressive decline in physiological integrity. DNA methylation represents a primary epigenetic hallmark of ageing, yet sex-specific patterns of epigenetic ageing within and across tissues remain poorly understood. This study aims to address these gaps through an integrated analysis of sex-moderated epigenetic ageing across eight human tissues. MethodsA total of 137 DNA methylation datasets comprising over 36,000 individuals aged 10-114 years were analysed using a meta-analytic workflow to identify age-associated differentially methylated positions (aDMPs) and regions (aDMRs), meta-regression to assess sex moderation, and pathway enrichment analyses to interpret functional relevance. FindingsIndividual tissues displayed distinct age-related methylation trajectories, but some DMP sites showed consistent hyper- or hypomethylation across tissues. Across tissues, we identified 68,630 aDMPs (10%) robustly associated with ageing. Age-associated changes at the regional level were less common, with only 80 robust age-associated aDMRs detected across tissues, representing 0.09% of analysed regions. Sex moderation was observed for only 16 aDMPs (0.002%), indicating that sex effects on age-associated DNA methylation are largely tissue-specific rather than shared across tissues. InterpretationOur findings indicate that age-associated DNA methylation changes predominantly occur at isolated CpG sites rather than extended genomic regions and are strongly dependent on tissue and genomic context. The minimal overlap of sex-moderated methylation signals across tissues suggests that age-related sex differences at the epigenetic level are more likely attributable to tissue- and cell-type-specific variation rather than to broadly conserved epigenetic mechanisms shared across tissues. FundingThis study was funded by an Australian Research Council (ARC) Discovery project (DP200101830). Severine Lamon was funded by an ARC Future Fellowship (FT210100278). Nir Eynon was funded by NHMRC Investigator Grant (APP1194159), and a Hevolution/AFAR New Investigator Award in Aging Biology and Geroscience Research. Mandhri D. Abeysooryia was supported by an Australian Government Research Training Program (RTP) Scholarship. Research in context Evidence before this studyDNA methylation is widely recognised as a central epigenetic hallmark of ageing. Previous research has demonstrated that some age-related methylation changes are conserved across tissues, forming the basis of pan-tissue epigenetic clocks. Most studies to date have primarily examined age effects in isolation. Although biological sex influences ageing trajectories and susceptibility to nearly all age-related diseases, sex-moderated epigenetic ageing has received limited investigation. Specifically, pan-tissue clocks, including GrimAge and PhenoAge, are "sex-aware" but were trained and validated in mixed-sex cohorts, limiting their capacity to disentangle tissue-specific sex effects. Consequently, it remains unclear whether sex-moderated epigenetic ageing signals are shared across tissues or are tissue-specific. Added value of this studyThis study provides a large-scale, comprehensive multi-tissue analysis of sex-moderated epigenetic ageing, integrating 137 DNA methylation datasets across eight human tissues and more than 36,000 male and female individuals spanning the lifespan. Our findings show that while age-associated methylation changes are widespread at the CpG level, sex-moderated effects are rare and largely tissue-specific, with minimal overlap across tissues. Implications of all the available evidenceTogether, the available evidence indicates that epigenetic ageing is predominantly driven by shared, conserved age-related methylation changes, whereas sex differences in epigenetic ageing are modest and context dependent. These sex-related effects are more likely to reflect tissue- and cell-type-specific variation rather than widespread, shared mechanisms. This underscores the need to develop sex-specific epigenetic clocks and to conduct longitudinal cohort and intervention studies to more precisely characterise sex-specific dynamics of epigenetic ageing across tissues.